Goto

Collaborating Authors

 Darlington County


'We had people come just to see it': Amazon delivers its first UK parcels by drone

BBC News

'We had people come just to see it': Amazon delivers its first UK parcels by drone Amazon has become the first retailer in the UK to start a drone delivery service with a limited launch in Darlington, County Durham. Packages weighing less than 5lb (2.2kg) and containing everyday items such as beauty products, batteries and cables are now being delivered within a 7.5 mile (12km) radius of Amazon's fulfilment centre. The tech giant is convinced there is demand for ultra-fast deliveries and hopes to slowly expand the service. Rob Shield let Amazon use an Airbnb on his farm for its first test runs. Initially it was a novelty, so we were ordering everything under the sun, he says.


Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control

arXiv.org Artificial Intelligence

Pretrained language models have demonstrated extraordinary capabilities in language generation. However, real-world tasks often require controlling the distribution of generated text in order to mitigate bias, promote fairness, and achieve personalization. Existing techniques for controlling the distribution of generated text only work with quantified distributions, which require pre-defined categories, proportions of the distribution, or an existing corpus following the desired distributions. However, many important distributions, such as personal preferences, are unquantified. In this work, we tackle the problem of generating text following arbitrary distributions (quantified and unquantified) by proposing Nano, a few-shot human-in-the-loop training algorithm that continuously learns from human feedback. Nano achieves state-of-the-art results on single topic/attribute as well as quantified distribution control compared to previous works. We also show that Nano is able to learn unquantified distributions, achieves personalization, and captures differences between different individuals' personal preferences with high sample efficiency.